052617.1140 HOUSTON 213434 v4 (P00-2968 ESSG-SPD) 



APPLICATION FOR PATENT 



TITLE: 



UNIVERSAL ASYNCHRONOUS BOUNDARY MODULE 



INVENTORS 



PARAS A. SHAH AND PRASHANTHA KALLURAYA 



SPECIFICATION 



CROSS-REFERENCE TO RELATED APPLICATIONS 



[0001] 



None. 



STATEMENTS REGARDING FEDERALLY 
SPONSORED RESEARCH OR DEVELOPMENT 



[0002] Not Applicable. 

REFERENCE TO A MICROFICHE APPENDIX 
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BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0004] The present invention generally relates to application specific integrated circuits 
(ASIC) and to a method for communications, and more particularly, to a logic device for 
facilitating an interface between logic circuits operating at two differing clock domains. 

2. Description of the Related Art 

[0005] Typically, computer hardware includes various data buses for transmitting data 
from one device to another. For example, a personal computer (PC) may have a main bus, a 
local bus, a video bus, etc. A computer server typically includes multiple microprocessors 
and data buses. Should a first device (e.g., a processor) on one bus need to communicate to a 
second device (e.g., a graphic's adapter) on another bus, an interface, such as a bridge, is 
employed to transfer the data seamlessly from one bus to another, and vice versa, if needed. 
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[0006] At times, the various computer buses may operate at different clock rates. For 
example, one of the local buses, such as a PCI (and its derivatives, PCI-X) may operate at 66 
MHz, 100 MHz or 133 MHz and a legacy local bus, such as an ISA bus may operate at 8 
MHz. Another example includes buses within an ASIC chip, such as an Inter Module Bus 
(1MB), which may operate at a higher frequency than a PCI (or PCI-X) bus, such as 200 
MHz. 

[0007] A prior method for coupling logic circuits operating at different frequencies (e.g., 
a first logic circuit operating at 100 MHz coupled to a second logic circuit operating at 50 
MHz) includes an interface that would create a stretched version of a signal from the first 
logic circuit that was long enough to be detected by the second logic circuit. That is, the first 
.logic circuit would send signals that are 10 nanoseconds long. But since the second logic 
circuit operates at 50 MHz, the second logic circuit would generally not be able to detect the 
signals since the second logic circuit would typically be able to detect signals that are 20 
nanoseconds or longer. Consequently, the prior method would stretch a signal in time by 
resending the signal (signals transmitted in consecutive clock cycles). Thus, this prior 
method requires knowledge of the clock frequency domains for each logic circuit so that the 
interface can be designed to provide the necessary signal between the logic circuit. In 
addition, this prior method typically utilized logic that is inflexible to changing 
circumstances. That is, prior interfaces were designed to operate between particular logic 
circuit operating at known particular frequencies. Consequently, these interfaces generally 
could not be used with logic circuits operating at other frequencies that they were not 
designed for. 

[0008] In addition, the prior method is typically limited by a frequency ratio. As 
mentioned above, interfaces used in prior methods are generally designed to handle a 
particular frequency ratio, e.g., 3:1. Thus, the frequency domain on one side could not be 
greater than 3 times the frequency of the other side. Furthermore, the prior method typically 
requires more logic to implement, thus requiring more precious silicon real estate. 



2 



1 

052617.1140 HOUSTON 213434 v4 (POO-2968 ESSG-SPD) 



BRIEF SUMMARY OF THE INVENTION 

[0009] The present invention has broad applicability to situations where data is being 
communicated between two different frequency domains. However, the present invention 
will be described in the context of communications within a computer PC or server. 

[0010] An interface, such as a Universal Asynchronous Boundary Module (UABM) is 
employed between logic circuits that operate at different clock or frequency domains. The 
UABM samples an input data stream from a first logic circuit and the UABM transmits the 
same data at a clock rate equal to the clock domain of a second logic circuit. In one 
* embodiment, the UABM detects signal level changes (logic state changes) in the input data 
stream from the first logic circuit at a first clock rate, and upon an assertion (e.g., a pulse) in 
the input data stream, the UABM inverts the logic state of an intermediate signal. The 
intermediate signal then propagates through the UABM at a second clock rate. When the 
level change is detected in the intermediate signal, an asserted signal is outputted to the 
second logic circuit at the second clock rate, thus providing the input data stream at the first 
clock rate to the second logic circuit at the second clock rate. 

[0011] Another embodiment includes an assertion of a mask signal. Should either logic 
circuit be reset, a signal is applied to mask the output of the UABM to prevent erroneous data 
from being transmitted to the second logic circuit. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0012] A better understanding of the present invention can be obtained with the following 
detailed description of the preferred embodiment is considered in conjunction with the 
following drawings, in which: 

Figure 1 is a computer system employing a single processor in which the techniques 
of this invention can be implemented; 

Figure 2 is a computer system utilizing multiple processors in which the techniques of 
this invention can be implemented; 

Figure 3 is a logic diagram of a universal asynchronous boundary module; 

Figure 4 is a timing diagram wherein the input logic circuit is reset; 

Figure 5 is a timing diagram wherein the output logic circuit is reset; 
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Figure 6 is a timing diagram wherein the input logic circuit operates at a higher clock 
rate than the output logic circuit; 

Figure 7 is a timing diagram wherein the input logic circuit operates at a lower clock 
rate than the output logic circuit; and 

Figure 8 is a flow chart illustrating an exemplary technique of interfacing logic 
circuits that operate at different clock rates. 

DETAILED DESCRIPTION OF THE INVENTION 

[0013] Turning to the drawings, Figure 1 illustrates a computer 10 having multiple buses, 
including a CPU bus, a mezzanine or PCI bus, and a peripheral bus or ISA/EISA bus. The 
CPU bus connects a CPU or processor 12 to a bus interface unit or north bridge 14. A cache 
memory 16 can be embodied within or external to CPU 12. 

[0014] North bridge 14 provides an interface between components clocked at dissimilar 
rates. According to a typical system, the north bridge 14 interfaces a slower PCI bus and a 
faster CPU bus. The north bridge 14 may also contain a memory controller that allows 
communication to and from system memory 18. A suitable system memory 18 comprises 
DRAM or preferably SDRAM. The north bridge 14 may also include a graphics port to 
allow connection to a graphics accelerator 20. A graphics port, such as the Accelerated 
Graphics Port ("AGP") provides a high performance, component level interconnect targeted 
at three dimensional graphic display applications and is based on performance extensions or 
enhancements to PCI. AGP protocol is generally standard in the industry, the description of 
which is available from Intel Corporation. 

[0015] The north bridge 14 is generally considered an application specific chip set, or 
application specific integrated circuit ("ASIC") that provides connectivity to various buses, 
and integrates other system functions such as a memory interface. System memory 18 is 
considered the main memory and refers to a portion of addressable memory that the majority 
of memory accesses target. The system memory 18 is accessed via the north bridge 14. 

[0016] Unlike the CPU bus, which runs at speeds comparable to CPU 12, the PCI bus 
generally runs at speeds of 33 MHz. A south bridge 28 is coupled between the PCI bus and 
the peripheral bus. Similar to the north bridge 14, the south bridge 28 is an ASIC or group of 
ASICs that provide connectivity between various buses, and may also include system 
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functions that can possibly integrate one or more serial ports. Attributable to the PCI bus are 
input/output ("I/O") devices 30 which require higher speed operation than I/O devices 32. 

[0017] Figure 2 illustrates a computer system 200 having multiple processors 202-216. 
The processors 202-216 are coupled to processor bus address/data controllers 218, 220 and 
222. The processors 202-216 are not limited to a particular type, but in one embodiment, the 
processors are Intel IA-64 microprocessors (Intel Corp. of Santa Clara, CA). 

[0018] The computer system 200 can also include various PCI-X expansion cards such as 
a SCSI controller 224, a network controller 228, an interrupt controller 230 and a server 
management chip 232. The computer system 200 can also include memory controllers 242- 
250 for the processors 204-216 to access memory, such as RAM. The PCI-X expansion 
cards are coupled to bridges, which may include universal asynchronous boundary modules 
(UABM) 234 and 236. Communications between the processors 202-216 are done via a data 
bus. To ensure high data rates, the data bus is coupled to an I/O bridge 238. 

[0019] Next, the I/O bridge 238 is coupled to a high-speed Inter-Module Bus (1MB) 240. 
In certain computer system configurations, the 1MB and the PCI-X bus operate at different 
clock rates. For example, the 1MB 240 in one embodiment can operate at 200 MHz and a 
PCI-X bus can operate at 33 MHz. Thus, the UABMs 234 and 236 within the bridges couple 
the 1MB and PCI-X buses and provide the interface between the two different data buses, so 
that data from one logic circuit can communicate with another logic circuit operating at a 
different clock rate. 

[0020] The above description is illustrative only. As mentioned previously, the invention 
can be implemented in other applications, unrelated to computer systems. For example, the 
invention can be implemented in a cellular telephone, such as a telecommunication ASIC 
chip operating at a first clock rate coupled to a microcontroller operating at a second clock 
rate. 

[0021] Figure 3 illustrates a logic diagram of one embodiment of a UABM implemented 
employing the techniques according to the invention. As mentioned previously, the UABM 
can be implemented (e.g., within a bridge) to couple data buses, such as PCI to ISA, or PCI to 
1MB. The input to the UABM 300 is to an input logic block 318, wherein the input to the 
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input logic block 318 is identified by the signal line INPUT PULSE to a multiplexer 302. 
The input logic block 318 includes the multiplexer 302 and a memory device or register, such 
as a D-type flip-flop 304. The INPUT_PULSE signal may originate from another input logic 
circuit (not shown) that is coupled to a data bus, such as an 1MB bus. Depending on whether 
the INPUT PULSE has a logic 0 or 1 value, the multiplexer 302 provides a logic 0 or 1 to the 
register 304. 

[0022] The register 304 is clocked by an INPUT CLOCK signal. The INPUT_CLOCK 
signal can also be used to clock the devices in the input logic circuit (not shown). The 
register 304 can also be reset by an rNPUT_RESET_ signal (for signal convention a 
SIGNAL_ refers to an active low, i.e., logic 0, for an asserted signal). Inputs to the 

W multiplexer 302 are provided by the Q and Q outputs of the register 304. The multiplexer 

302 outputs the Q (an inverted Q signal) signal to the D input of the register 304 when the 
fU INPUTPULSE signal to the multiplexor 302 is asserted (i.e., a pulse is received). Thus, if 
y, the INPUT PULSE signal is deasserted, the multiplexor 302 provides the same signal (from 
l"i the Q output of the register 304) to the D input of the register 304, and the Q output remains 
U unchanged at the next INPUT_CLOCK clock cycle. 

[0023] If the INPUTJPULSE signal is a pulse (an assertion), then the multiplexer 302 

output switches to its S2 input, which is coupled to the Q output of the register 304. This 
inverts the Q output of the register 304 in the next clock cycle. As noted before, the register 
304 is clocked by the INPUT_CLOCK signal. Conversely, if the INPUT_PULSE signal is 
not a pulse (i.e., a deasserted signal), then the multiplexor 302 switches to its Si input, which 
is coupled to the Q output of the register 304 and provides the signal Q to the D input of the 
register 304. Therefore, the register 304 would continue to clock the same data until the 
multiplexer 302 would receive an asserted INPUTPULSE signal. A detailed view of the 
signal states of the UABM 300 will be provided in later figures. 

[0024] Registers 306-308 are coupled to the register 304, and are preferably provided to 
address metastability issues. The registers 306-308 are clocked by the clock of a second logic 
circuit (not shown), identified as an OUTPUT_CLOCK signal. As mentioned previously, the 
second logic circuit may be composed of PCI devices operating at a different clock rate than 
the first logic circuit. 
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[0025] To address the issue of metastability, should the clock rates for the 
INPUT_CLOCK and OUTPUT CLOCK be different, the additional registers 306-308 are 
added in the UABM 300. The registers 306-308 (or metastability registers) can be any type 
of memory device, such as D-type flip-flops. The number of registers used to address 
metastability issues is not necessarily limited by the number of registers shown in the figure. 
One solution for addressing metastability issues is to register the signal a second, and 
possibly a third time, using the plurality of D-type flip-flops. In one embodiment, two (2) 
metastability registers 306 and 308 are utilized when one of the logic circuit operates at 200 
MHz. 

[0026] Thus, at each OUTPUTCLOCK clock cycle, data (or for the purposes of clarity, 
an intermediate signal) propagates through the registers 306-308 at the clock rate of the 
OUTPUT_CLOCK signal rate. The Q output of the metastability register 308 is provided to 
the D-input of a register 310. The registers 310 and 312 are also clocked by the 
OUTPUT_CLOCK signal. The signal from the Q output of register 310 is provided to the D 
input of a register 312. The registers 310 and 312 can be any type of memory device, such as 
a D-type flip-flop. The registers 310 and 312 along with an XOR gate 314 comprise an 
output logic block 320 of the UAMB 300. When the OUTPUT_CLOCK signal provides a 
clock pulse to the register 312 clock input, the intermediate signal is clocked out of the 
register 312 to the XOR gate 314. Thus, when the D input of the register 312 is different 
from its Q output, the XOR gate 314 provides a logic 1 to an AND gate 316. Thus, as an 
assertion in the INPUT^PULSE signal is received by the input logic block 318, the output 
logic block 320 provides the same signal, however, at the clock rate provided by the 
OUTPUT_CLOCK signal. This is because the OUTPUT_CLOCK signal propagates the 
intermediate signal through the final register 312 in one clock cycle, so the inputs to the XOR 
gate 3 14 are thus different for that single clock cycle. 

[0027] To ensure signal integrity as a result of one logic circuit resetting while the other 
logic circuit does not, the output signal of the output logic block (XOR gate 314) is ANDed 
with an OUTPUT_MASK_ signal. For example, it may be necessary to reset one logic 
circuit as a result of a change in clock frequencies, as denoted by the INPUTCLOCK signal. 
One such scenario is when a PCI card is operating at 66 MHz. The computer system may 
allow hotplugging of PCI cards. Thus, if one PCI card is removed and replaced with another 
PCI card that operates at a different frequency, such as a 100 MHz, the computer system will 
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reset this logic circuit. However, it is possible that the rest of the computer system will not be 
reset. Thus, during this transition, should one of the logic circuits reset and not the other, a 
deasserted OUTPUT_MASK_ signal is provided to the AND gate 316 so that the 
OUTPUT_PULSE signal is not erroneous due to the reset of one of the logic circuits. 

[0028] Figure 4 is a timing diagram wherein an input logic circuit is reset. At timing 
reference 400, the INPUTJPULSE signal is asserted (e.g., a pulse). The multiplexer 302 

provides the Q output of the register 304 to the D input of the register 304. Therefore, the 
logic state of the Q output of the register 304 inverts (for illustrative purposes, the Q output of 
the register 304 transitions from logic 0 to logic 1 at time reference 400, as denoted by the 
Q_I1 signal) after one clock cycle of the INPUT_CLOCK. After propagating through the 
metastability registers 306 and 308, the OUTPUT_PULSE signal is asserted at timing 
reference 402 (since the inverted signal is detected). Should the input logic circuit be reset, 
the register 304 would also be reset (the Q output of register 304 transitions to logic 0 and the 

Q output of the register transitions to logic 1). 

[0029] For example, at timing reference 404, an INPUT_RESET_ signal is asserted (in 
this case, an assertion is an active low, or logic 0). Thus, the Q output of the register 304 is 
reset to logic 0. However, since the metastability registers 306-308 and output registers 310 
and 312 are unaffected by the 1NPUT_RESET_ signal, the Q output of register 304 will 
propagate through the registers 306-312 at a clock rate equal to the second clock 
(OUTPUT_CLOCK). Consequently, once the logic 0 signal of the Q output of the register 
304 propagates to the Q output of the register 310, an OUTPUT_PULSE asserted signal 
would be provided. Therefore, to mask this erroneous pulse, an OUTPUT_MASK_ signal is 
asserted, sometime after the INPUT_RESET_ signal is asserted, at timing reference 406. 
Referring back to Figure 3, the output of the XOR gate 314 and the OUTPUT_MASK_ signal 
are ANDed at AND gate 316. Since the OUTPUT_MASK_ signal is asserted (logic 0) the 
OUTPUTPULSE signal does not include the erroneous pulse 410. 

[0030] Figure 5 is a timing diagram wherein an output logic circuit is reset. At timing 
reference 500, the INPUTPULSE signal is asserted (e.g., a pulse). The multiplexer 302 

provides the Q output of the register 304 to the D input of the register 304. Consequently, 
the register 304 provides an inverted signal at the next INPUT_CLOCK clock cycle. After 
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the signal propagates through the metastability registers (e.g., 306 and 308), the output logic 
block 320 provides a pulse (in the OUTPUT_CLOCK domain) at timing reference 502. 
Should the output logic circuit be reset, an OUTPUT_RESET_ signal is asserted at timing 
reference 504. Once the OUTPUT_RESET_ signal is asserted, the metastability registers 306 
and 308 and output logic block 320 registers 310 and 312 are also reset (i.e., the Q outputs of 
these registers transition to logic 0). Consequently, when the registers 306-312 come out of 
reset (see timing reference 508), it is possible that the output logic block 320 would produce 
an erroneous pulse 512 at timing reference 510. Therefore, to prevent the erroneous pulse 
512 from being generated, the output of the logic block 320 is ANDed with an 
OUTPUT_MASK_ signal at AND gate 316 to mask the OUTPUT_PULSE signal. 

[0031] Figure 6 is a timing diagram wherein the input logic circuit operates at a higher 
clock rate than the output logic circuit. As noted in the timing diagram, the INPUT CLOCK 
signal has a higher data rate than the OUTPUT CLOCK signal. Furthermore, POINT A, 
POINT_B, POINT_C and POINT_D signals refer to the logic state of the signals at the 
locations designated by A, B, C, and D, as shown in Figure 3. The signal denoted by 
RESET_ denotes the INPUT_RESET_ and OUTPUT_RESET_ signals of Figure 3. 
(RESET_ represents that a signal is active low, i.e., asserted with a logic 0). 

[0032] At a time reference 600, the RESET_ signal is deasserted. The INPUT_PULSE is 
at logic 0 during this time. At the next clock cycle of the INPUTCLOCK signal, the logic 0 
propagates through the register 304, as denoted by the logic 0 at POINT B. For clarity 
purposes, the logic state of POINT B refers to the initial logic state of an intermediate signal. 
The intermediate signal traverses through the registers at each output logic circuit clock cycle 
as denoted by POTNT_C and POINT_D. Since the logic states of POINT_C and POINT_D 
are logic 0, the OUTPUT_PULSE signal is also a logic 0 at each instance of the 
OUTPUT_CLOCK cycle. 

[0033] Thereafter, at time reference 604 the INPUT_PULSE signal changes to a logic 1, 
causing the multiplexer 302 to also provide a logic 1 to the D input of the register 304 as 
denoted by POINT_A. At the next clock cycle of the TNPUT_CLOCK, the register 304 
provides the logic 1 at its Q output as denoted by POINTJB. The intermediate signal 
traverses metastability registers (note, the timing figure takes into account two (2) 
metastability registers) at the OUTPUTCLOCK rate. Thus, at two clock cycles of the 
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OUTPUT_CLOCK signal, a logic 1 is provided to the D input of the register 312. Since, the 
register 312 is being clocked by the OUTPUT_CLOCK signal, the Q output of the register 
changes to a logic 1. Since POINT C and POINT D are logically different (as shown at time 
reference 610), the output of the XOR gate 314 transitions to a logic 1 as denoted at time 
reference 610. Once POINTD transitions to a logic 1, the output of the XOR gate 314 
transitions to a logic 0 (as shown at 612). The OUTPUT_PULSE remains at a logic 0 until 
the INPUTPULSE transitions from a logic 0 to a logic 1 (as shown at 614). The time 
references 614, 616, 618, 620, and 622 show a second INPUT_PULSE that causes an 
OUTPUT_PULSE assertion. The behavior is identical to the above description, except the 
initial logic state of the intermediate signal (POINT_B) is now 1 instead of 0. 

[0034] Figure 7 illustrates a timing diagram where the input logic block operates at a 
lower clock rate than the output logic block. As shown in the figure, the INPUT_PULSE 
signal transitions to a logic 1 for one clock cycle commencing at 700. Sometime thereafter 
(timing reference 702), POINT_A transitions to a logic 1 as a result of the multiplexer 302. 
Once again, for clarity purposes, the logic state of POINTB refers to the initial logic state of 
an intermediate signal. The output of the register 304 transitions to a logic 1 after one clock 
cycle of the INPUTCLOCK (as shown in 704). The intermediate signal is clocked through 
the metastability registers at the OUTPUT CLOCK rate. Thus, at 706, POINTC transitions 
to a logic 1. Since POINT_D was previously at a logic 0, the output of the XOR gate 314 
transitions to a logic 1. Since the register 312 will clock the data at the OUTPUT_CLOCK 
rate, at 708, POINT_C and POINT_D are equal, thus the OUTPUT_PULSE signal remains at 
a logic signal 0 until the INPUTJPULSE signal transitions to a logic 1 (i.e., the 
INPUT_PULSE signal is a pulse, as shown at 710). The time references 710, 712, 714, 716, 
and 718 show a second INPUT_PULSE that causes an OUTPUT_PULSE assertion. The 
behavior is identical to the above description, except the initial state of the intermediate 
signal (POINT_B) is now 1 instead of 0. 

[0035] Figure 8 is a flow chart illustrating exemplary technique of interfacing logic 
circuits that operate at different clock rates. The method starts at step 800. An input logic 
block receives an input data stream from a first logic circuit at step 802. At step 804, an 
asserted signal of the input data stream is ascertained at a first clock rate. At step 806, if the 
input data stream is an asserted signal, the method proceeds to step 808. If the input data 
stream is not an asserted signal, then the method proceeds to step 807. At step 807, the logic 
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state of an intermediate signal is not inverted. The method then proceeds to step 810 where 
the intermediate signal is propagated to an output logic block, which runs at a second clock 
rate. 

[0036] At step 808, the logic state of an intermediate signal is inverted. Next, at step 810, 
the intermediate signal is propagated to an output logic block, which runs at a second clock 
rate. At step 812, the intermediate signal is compared at time To and T_i (the time difference 
between T 0 and T_i is one clock period of the second clock rate). At step 814, a 
determination is made to whether the logic states of the intermediate signal at To and T_i are 
different. If the logic states of the intermediate signal at T 0 and T-i are not different, the 
method proceeds to step 815. At step 815, the output logic block provides a deasserted signal 
at a second clock rate to a second logic circuit. Next, the method ends at step 822. 

[0037] At step 814, if the logic states of the intermediate signal at T 0 and T.i are different, 
an asserted signal is outputted at step 816. Next, a determination is made at step 818, to 
ascertain whether the first logic circuit or the second logic circuit has been reset. If either the 
first logic circuit or the second logic circuit has been reset, a signal is applied to mask the 
output at step 824. The method ends as step 826. If neither the first logic circuit nor the 
second logic circuit have been reset, the asserted signal output is provided to a second logic 
circuit at step 820. The method ends at step 822. 

[0038] As mentioned previously, alternatives to the various devices described above may 
be implemented. For example, the figures described above include flip-flops, multiplexors, 
and XOR and AND gates. These logic devices could be implemented instead in 
programmable logic. 

[0039] The foregoing disclosure and description of the invention are illustrative and 
explanatory thereof, and various changes in the size, shape, materials, components, circuit 
elements, wiring connections and contacts, as well as in the details of the illustrative circuitry 
and construction and method of operation may be made without departing from the spirit of 
the invention. 
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